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ABSTRACT 

With the rich set of embedded sensors installed in smart¬ 
phones and the large number of mobile users, we witness 
the emergence of many innovative commercial mobile crowd¬ 
sensing applications that combine the power of mobile tech¬ 
nology with crowdsourcing to deliver time-sensitive and location- 
dependent information to their customers. Motivated by 
these real-world applications, we consider the task selection 
problem for heterogeneous users with different initial loca¬ 
tions, movement costs, movement speeds, and reputation 
levels. Computing the social surplus maximization task al¬ 
location turns out to be an NP-hard problem. Hence we 
focus on the distributed case, and propose an asynchronous 
and distributed task selection (ADTS) algorithm to help the 
users plan their task selections on their own. We prove the 
convergence of the algorithm, and further characterize the 
computation time for users’ updates in the algorithm. Sim¬ 
ulation results suggest that the ADTS scheme achieves the 
highest Jain’s fairness index and coverage comparing with 
several benchmark algorithms, while yielding similar user 
payoff to a greedy centralized benchmark. Finally, we illus¬ 
trate how mobile users coordinate under the ADTS scheme 
based on some practical movement time data derived from 
Google Maps. 

Categories and Subject Descriptors 

C.2.4 [Computer-Communication Networks]: Distributed 
Systems— Distributed applications 
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1. INTRODUCTION 

Today’s smartphones and wearable devices include a rich 
set of embedded sensors, such as cameras, microphones, global 
positioning systems (GPS), thermometers, and accelerome¬ 
ters muni- Thanks to the large number of mobile users and 
their inherent mobility, we are witnessing the rise of mobile 
crowdsensing (MCS), where individuals use their mobile de¬ 
vices to collectively extract and share information related to 
some phenomenon of interest. Applications of MCS include 
traffic jam alerts, wireless indoor localization, and small cell 
network monitoring. 

Recently, commercial MCS platforms, such as Gigwalk [6] 
and Field Agent [3j, combine the power of mobile technology 
with crowdsourcing to provide their customers with time- 


sensitive and location-dependent information related to their 
stores and products. For example, in Field Agent [3], there 
are two types of tasks, namely audit and research. An audit 
task mainly involves fact finding and data gathering (such 
as checking on-shelf availability and prices), while a research 
task mainly involves the collection of users’ opinions and 
insights (such as surveys and shop-alongs). For a location- 
dependent and time-sensitive sensing task, such as checking 
the on-shelf availability of Coca-Cola in a convenience store 
on Main Street at 9am, the users need to collect the data 
at the precise time and location. With this information, 
a district manager can reduce the cost of taking invento¬ 
ries, while maintaining the proper stock levels at different 
stores. Currently, several well-known brands and retailers 
(such as Coca-Cola and Johnson & Johnson) are among the 
customers of Gigwalk and Field Agent. This suggests that 
the collection of location-dependent and time-sensitive data 
using MCS is a practice of growing importance. 

A key question for MCS platforms is how to find users 
to complete tasks, while accounting for the users’ different 
initial locations, movement costs, movement speeds, and rep¬ 
utation levels? A number of recent results have focused on 
the centralized task allocation in MCS, with the objectives 
of either improving the energy efficiency or maximizing the 
social surplus. For example, Sheng et al. in |14 | considered 
opportunistic energy-efficient collaborative sensing. Given 
a set of roads, mobile devices, and their trajectories, the 
objective is to find a sensing schedule that minimizes the 
total energy consumption by reducing redundancy in sens¬ 
ing. Zhao et al. in [15] considered fair and energy-efficient 
task allocation in MCS, and solved a min-max aggregate 
sensing time problem. He et al. in [7] considered social sur¬ 
plus maximization for location-dependent task scheduling 
in MCS. They formulated a task scheduling problem, where 
the objective is to maximize the overall net reward of all 
the users, subject to the time budget of each user and the 
sensing redundancy constraint of each sensing task. 

Different from the above literature, which focuses on the 
collection of location-dependent data without any time con¬ 
straints in a centralized fashion, we consider the case where 
the service provider aims to collect time-sensitive and location- 
dependent information for its customers through distributed 
decisions of mobile users. The solution of such a problem 
needs to balance the rewards and movement costs of the 
users for completing tasks. The consideration along both 
space and time dimensions makes the design of a good solu¬ 
tion very challenging. The distributed nature of the solution 



that is required by the current commercial platform such as 
Gigwalk and Field Agent further complicates the analysis. 

In this paper, we focus on solving the distributed time- 
sensitive and location-dependent task selection problem, where 
users are heterogeneous in their initial locations, movement 
costs, movement speeds, and reputation levels. We for¬ 
mulate the interactions among users as a non-cooperative 
task selection game (TSG), and propose an asynchronous 
and distributed task selection (ADTS) algorithm for each 
user to compute her task selection and mobility plan. Each 
user only requires limited information on the aggregate task 
choices of all users, which is publicly available in many of to¬ 
day’s crowdsourcing platforms, such as Amazon Mechanical 
Turk (MTurk) jTj, Gigwalk 0, and Field Agent [3] . 

As a performance benchmark on the social surplus, we for¬ 
mulate a centralized task allocation (CTA) problem, assum¬ 
ing that all users are controlled by the service provider, and 
show that it is an NP-hard problem. We propose a heuristic 
greedy centralized algorithm that computes the approximate 
centralized solution with a lower complexity. 

To the best of our knowledge, this is the first paper that 
considers distributed time-sensitive and location-dependent 
data collection in MCS motivated by real-life commercial 
applications. To summarize, the contributions of our work 
are as follows: 

• Practical MCS modeling-. Motivated by commercial 
MCS applications, we consider the collection of time- 
sensitive and location-dependent sensing data by mul¬ 
tiple users. We assume that the users are heteroge¬ 
neous in their initial locations, movement costs, move¬ 
ment speeds, and reputation levels. 

• Asynchronous and distributed task selection algorithm: 
We propose a distributed algorithm that helps the users 
determine their task selections and mobility plans. Each 
user only needs to know limited information on the 
aggregate task choices readily available in the MCS 
mobile apps. 

• Convergence guaranteed: We show that the task selec¬ 
tion game has the finite improvement property, which 
means that users’ asynchronous best response updates 
globally converge to a Nash equilibrium. We also show 
that each best response update can be computed in 
polynomial time. 



|Cj Mobile User (Low reputation) Audit 


Figure 1: In a MCS platform, we assume that the 
service provider collects location-dependent time- 
sensitive information with the help of the users. 
Through the mobile app interface, the service 
provider would announce the rewards, locations, 
and execution times of the tasks to the users. Each 
user then decides how to move and which tasks to 
work on over the coming time slots. 


be the set of time slots. Let 1C = {1 ,... ,K} be the set of 
tasks. We describe the attributes related to a task as follows. 

Definition 1. (Task Attributes) Each task k € K . is 
associated with: 

• A location l[k] £ £ where the task must be performed. 

• A time t[k] € T when the task must be executecQ. 

• A reward amount p[k\ > 0 for performing the task. In 
the case where multiple users perform the same task, 
the reward is shared evenly among them [2]. 

For example, for task 5 in Fig. [T] we have Z[5] = 12, f[5] = 
10, and p[5] = $5. Notice that there can be multiple tasks 
at the same location, such as Z[6] = l[ 7] = 15. 

Let X = {1,...,/} be the set of users, who can move 
through space and perform tasks. Given a user’s initial loca¬ 
tion, the set of tasks that she chooses to work on is influenced 
by her movement time, movement cost, and reputation. For¬ 
mally, we define a user’s attributes as follows. 


• Balanced performance: Simulation results suggest that 
the proposed asynchronous and distributed task selec¬ 
tion scheme achieves the best performance in terms of 
fairness and coverage comparing with various bench¬ 
mark algorithms. 

2. SYSTEM MODEL 

We consider a mobile crowdsensing (MCS) platform that 
involves the collection of time-sensitive and location-dependent 
information in the form of photos, video, audio, data, opin¬ 
ions, and feedback. In the platform, once a user has installed 
the mobile app on her smartphone, she will be able to use 
the built-in map to check the available tasks, together with 
the task attributes including locations, execution times, and 
rewards, as shown in Fig. [I] 

More specifically, let £ = {1,..., L} be the set of locations, 
such as buildings, landmarks, or fields. Let T = {1,. .. ,T} 


Definition 2. (User Attributes) Each user i £ I is as¬ 
sociated with: 

• An initial location Z) mt € £. (For example, we have 
1^*6 in Fig. [U) 

• An (integer valued) movement time A\' 1 > 1, which 
equals the number of time slots it takes user i to move 
from location l to location l'. 

• A (real valued) movement cost c\' 1 > 0 for user i to 
move from location l to location l'. 

• A non-empty set /C; C 1C of tasks that user i is eligible 
for. 

x Each task k only takes one time slot to execute, and it 
cannot be executed before the corresponding time t[k]. 








If a user has specified her mode of transportation (e.g., 
walking, cycling, driving, or taking the bus), then her mo¬ 
bile device can retrieve data from Google Maps and local 
public transportation databases to compute her movement 
time and movement cost. For example, user i may have a 
zero cost (i.e., c-’ = 0) for walking, and a linear cost for 

driving. 

The set of eligible tasks Ki often relates to the reputation 
of user i. Reputation mechanisms are commonly found in 
crowdsensing applications, such as Gigwalk [6] and MTurk 
[Tj, to allocate tasks based on users’ past performance. A 
service provider can reserve the more complicated but higher 
paying tasks (e.g., research tasks) for users with higher rep¬ 
utations. For example, in Fig. [I] the lower reputation user 
1 can only work on the audit tasks, so /Ci = {1,3,4, 5, 7}; 
whereas the higher reputation user 2 can work on both the 
audit and research tasks, so JC 2 « {1,..., 7}. 

Given the rewards, locations, and execution times of dif¬ 
ferent tasks, each user needs to decide how she should move 
and which tasks she should select in order to maximize her 
total profit (i.e., reward minus movement cost). In Section^ 
we formulate the task selection of multiple users as a task se¬ 
lection game, and propose an ADTS algorithm. In Section 
[4] we establish a performance benchmark under the ideal 
centralized task allocation scenario. We use simulations to 
evaluate the performance of our proposed algorithms in Sec¬ 
tion [5] and draw conclusions in Section [6] 


3. DISTRIBUTED TASK SELECTION GAMES 

In this section, we formulate the users’ task selection prob¬ 
lem as a task selection game (TSG), where each user aims 
to maximize her own payoff. We also study the equilibria 
and convergence properties of the game. 

3.1 Task-Time Routing 

A user’s decision making includes two aspects. First, she 
must choose which locations to visit over the T time slots 
(i.e., selecting a route through space-time). Second, she 
must choose which tasks to do along that route. However, 
since each task k is only identified with a single location l[k ] 
and time t[k], we can simplify the modeling by just consid¬ 
ering how the users select different tasks at different times 
(i.e., choosing a route through task-time). 

Since a user may choose not to work on any task, for each 
user i we define an initial virtual task fc) nlt £ /Ci, which 
occurs at all time slots t[fc) mt ] £ T and at the same location 
Zffc- nlt l = l) nlt (user V s initial location) with a zero reward 
(i.e., p[fcj nit ] = 0). 

To clearly describe the task-time routing decision prob¬ 
lem, we use a sequences of task-time points, described in 
Definition [3] to represent a user’s task-time route. We say 
that a task-time route is available to a user when it is phys¬ 
ically possible for her to visit the corresponding locations 
over the time slots, and she is eligible to perform each of the 
tasks at those locations. 


Definition 3. (Available Task-Time Route) An avail¬ 
able task-time route for a user i is a sequence 


n = , ( ki,ti ),..., ( ki,ti )) £(Cx Tf 


(1) 


2 We can set the movement cost cf 1 = 00 if a user i is 

unwilling or unable to travel from location l to location l'. 



Figure 2: Task-time routes chosen by two users with 
audit (red circles) and research (blue circle) tasks. 


of n task-time points (for some n > 1) which satisfies the 
following conditions: 

1. Time increases: 1 = t\ < t% < ... < t" < T. 

2. The user is eligible for the tasks: k},h?,... ,k" £ /C;. 

3. Sequence starts at the initial virtual task: k\ = 

4. Sequence accounts for movement time: tf +1 — tf = 

A- , for each a £ {1, ... ,n— 1}. 

Condition 1 accounts for the fact that time is always in¬ 
creasing. Condition 2 ensures that the users are eligible to 
perform each of their chosen tasks. Condition 3 ensures that 
the user begins at her initial location. Condition 4 ensures 
that the time difference between successive elements in the 
sequence of task-time elements is equal to the movement 
time between the locations of the corresponding tasks. (In 
other words, it ensures that the movement time is indeed 
the time they spend moving.) When a user does not move, 
we define A*’ = 1, V i £ X, l £ £ to represent the fact that 
the user stays at the same location after one time slot. 

Based on the example in Fig. |T] by considering task 1 to 
task 4, we show an example of the task-time routes of two 
users in Fig. [5] Here the solid and empty circles represent 
task-time points with and without positive rewards, respec¬ 
tively. When a user’s route passes through a solid circle, it 
means that the user works on the task (e.g., user 2 works on 
task 3 at time 5). On the other hand, when a user’s route 
passes through an empty circle, it means that the user is 
only physically present at the location of a task, but she is 
not working on the task (e.g., user 2 is at location l[ 3] at 
time 3). In this way, a user can move to the location of a 
task before the actual execution time of that task (e.g., user 
2 arrives at location of task 3 at time 3, while the execution 
time of task 3 is at time 5). 

In Fig. [5] we suppose that a low reputation user 1 has 
movement times Aj 1 = 1 and Aj 1 ^® = 3, while a 

l init H4] 

high reputation user 2 has movement times A 2 2 ’ —1, 

Aj 4 !' 1 ! 3 ! = 1, and A^’ 1 ^ = 2. The purple and green curves 
represent the two task-time routes: n = ((fci mt , 1), (1, 2), (3, 5), 
(3,6), (3, 7)) and r 2 = ((fci, ni \ 1), (4, 2), (3, 3), (3,4), (3, 5), (2, 7)). 

3.2 Task Selection Game 






















Based on the discussion aforementioned, we can formulate 
the users’ task selection problem as a TSG, where users act 
as players that choose available task-time routes. 

In the task-time routing framework, changing tasks will of¬ 
ten involve changing locations and hence will involve move¬ 
ment costs. To allow a user to move to the location of a 
task before its execution time, we define the time-dependent 
reward p * [k, t'] for task k £ 1C and time t' £ T as 

pV] = {p[k], ^ *'=*[*]• 

0, otherwise. 

For example, in Fig.© the rewards for task-time points (3, 3) 
and (3,5) are p*[3, 3] = 0 and p*[3, 5] = p[3] = $2, respec¬ 
tively. We assume that the reward p*[k,t'] is evenly shared 
among all the users who have chosen to work on task k at 
time t' (i.e., the task-time point ( k,t ').) 

Definition 4. (Task-time points) Let us define the task¬ 
time points of a task-time route ri in © to be the set V(ri) = 
{(k},tj) , ,..., (fc™,f")}, of all task-time points vis¬ 

ited by the route n £ TZi , where 7 Zi denotes the set of all 
available task-time routes of user i. 


The payoff Ui(r) that user i gets for choosing route ri in 
© in a strategy profile r = (r i, ..., ri ) £ TZ\ x ... x 7 Zi is 
equal to 


Ui(r) 


(P*[k?,tt] \ \ ^ i[fc?],i[fc“ +1 ] 


( 2 ) 


where m^ k,t ^(r) = |{j € X : (k,t) € V(r ;) ')}| is the number 
of routes that pass through task-time point ( k,t ). The first 
term in © corresponds to the total reward that user i ob¬ 
tains (taking into account how the reward is shared evenly 
when multiple users perform the same task). The second 
term in © corresponds to the total movement cost, which 
user i spends in order to travel to the locations of the se¬ 
lected tasks. 


the users except user i. A strategy profile can be written as 
r = (n,r-i). 

Definition 7. (Better and best response updates) A 
better response update, starting from some strategy r = 
(r», r_i), is an event where a single user i changes to an¬ 
other strategy, r\ € TZi, and increases her payoff as a result, 
i.e., Ui(r'i,r~i ) > Ui(n,r-i). 

A best response update is a special type of better response 
update, where the newly selected strategy r[ maximizes user 
i’s payoff among user i’s all possible better response updates. 


Definition 8. (Pure Nash equilibrium) A pure Nash 
equilibrium (NE) is a strategy profile r*, where no user can 
perform a better response update unilaterally. 


Definition 9. (Finite improvement property) A game 
possesses the finite improvement property (FIP) when asyn- 
chronoufl better response updates always converge to a pure 
NE within a finite number of steps, irrespective of the initial 
strategy profile or the users’ updating order. 

Existence of the finite improvement property implies that 
better response updating always leads to pure Nash equilib¬ 
ria, which implies the existence of pure Nash equilibria^ 


Theorem 1. Every task selection game possesses the fi¬ 
nite improvement property. 


The proof of Theorem©is given in Appendix!© We then 
proceed to study how long it takes for a strategy profile to 
converge to a pure NE. Theorem [2] ensures that each best 
response update can be computed in polynomial time. 


Definition 5. (Task selection game) A task selection 

game Cl = (X, £, 7C, T, (p*[M])fce/c,ter, (c , A l f l ) ieI ,iy & c), 
involves each user (player) i choosing an available task-time 
route (strategy) ri £ TZi and receiving the payoff in ©. 


Definition 6. (Task-time point pairs) The task-time point 
pairs E(ri) of a task-time route ri in © is the set E(tt) = 

| [(fcf, f“), (fc“ +1 ,f“ +1 )] : a = 1, — l| of task-time 

point pairs subsequently visited by the route ri. 


Let 9i ([(fcf, tf ), (fc“+\ t a + 1 )] ) = c ‘ [h * 0] - ,Ifc ? +1] be the move¬ 
ment cost between two adjacent task-time points of user i. 
The payoff of user i in © can be rewritten as 


Ui(r) 


\ ^ P*[M] \ 

-Z-/ m (k,t)i r \ I 

(M)ev(r-i) v ’) 


E 9 i(e). (3) 

eGE(rj) 


3.3 Equilibrium Existence and Convergence 
Analysis 

Before analyzing the task selection games, let us recall 
some commonly used definitions from game theory. Let 
r-i = Or,..., Vi—±, .. •, ri) denote the strategies of all 


Theorem 2. A best response update can be computed in 
0{K 3 T 3 ) time. 

The proof of Theorem [5] is given in Appendix!© The proof 
is constructive, as it allows us to compute best response 
updates in polynomial time. Simulations suggest that the 
number of best response updates required to reach a pure 
Nash equilibrium grows linearly with the number of users in 
a wide variety of scenarios, although a theoretical proof is 
quite challenging to obtain. 

3.4 Asynchronous and Distributed Task Selec¬ 
tion Algorithm 

Theorems [T] and [2] guarantee the convergence of our asyn¬ 
chronous and distributed task selection (ADTS) algorithm 
(Algorithm©. To initialize the algorithm, a user inputs her 
private information on the movement cost (line 2), checks 
the task descriptions (line 3), and then computes her move- 

asynchronous updates imply that there will be no two users 
updating their strategies at the same time. 

4 Note that the time steps involved in understanding the con¬ 
vergence of the best response updates is not the same time 
slot that we introduced in Section© 





Algorithm 1 Asynchronous and distributed task selection 
(ADTS) algorithm for user i G X. 


1: Initialization 

2: User Input: Movement cost cf l Wl,l' E £• 

3: Check the task description, location l[k\, time t[k\, and reward 
p[k] for all task k G JC on the mobile app interface. 

4: Calculate movement time A l f l 6 , V Z, Z 7 G jC using Google Maps 
data based on movement speed z/j. 

5: Planning Phase: Task Selection Game 


6: repeat 

7: Check clock timer r on the mobile app. 

8: if r G T; 

9: Check the mobile app for the number of participants 

for all ( k,t ) £ H. 

10: Calculate the number of participants excluding user i 

itself: : = q <>’*) - qf’^, V (fc, t) € H. 

11: Perform a best response update: Find a route r* G 7 Zi 

that maximizes user i’s payoff: 


12 : 


Ui(ri, r—i) : = 


P* [M] 

(M)ev(r-i) Q-i^ + 1 


E 


Update the task selection decision: 


- E 9i ( e )- 

eGE(rj) 




13: Report q ; := (<j^ fc,t , V (fc,t) £ H) to the service provider. 

14: end if 

15: until r > r max . 

16: Data Collection and Sensing Phase 
17: for each user i £ X 

18: Move and complete the sensing task in each time slot t 

based on the task selection plan r,;. 

19: end for 


ment time from Google Maps dat^EI based on her speecQ 
(line 4). 

With the aggregate informatiorQ on the total number of 
users working on different tasks (line 9) provided by the 
service provider in Algorithm [21 each user performs a best 
response update (lines 11 and 12), and claims the tasks by 
sending her updated task selection q i = {q\ k,t \ V (k, t) £ 
H) to the service provider. Here, H = {( k,t ) £ K, x T : 
p* [k, t] > 0} is the set of task-time points that provide a 
positive reward. 

Notice that users only need to claim the tasks that they 
are interested in working on, and do not need to reveal their 
movement plans to the service provider. This will preserve 
the privacy of users. Let Ti be the set of iteration^ during 
which user i updates her task selection strategy. We assume 
that each user updates her strategy distributively and asyn¬ 
chronously until a predefined iteration limit r max . "We set 
r max to be a large enough value such that the ADTS can 

5 Google Maps assume an average walking speed of about 
3 miles/hour for pedestrians. For example, if the Google 
Maps show that movement time between locations l and 1/ 
is 2 mins, then user i with movement speed of 4 miles/hour 

would have a movement time A*’* = 1.5 mins. 

6 Prototype systems, such as BreadCrumbs [12], can track 
the movement speed of the device’s owner. 

7 Such information is often available for users in crowdsourc¬ 
ing platforms. 

s Note that an iteration only takes a fraction of a single time 
slot. 


Algorithm 2 Information Update Algorithm for the Service 
Provider. _ 

1: Initialization 

2: Announce the task description, location Z[fc], time t[fc], and 
reward p[k\ for all task k £ JC on the mobile app interface. 

3: Allocate memory for q^ k,t \ Vi £ X, (fc,t) £ H. 

4: Initialize clock timer r := 1 on the mobile app interface. 

5: Information Update for Task Selection Game in Algorithm [l] 
6: repeat 

7: if task selection update message q i is received from user i 

8: Calculate the number of participants for all ( k,t ) £ H: 

q(M) — £ ie x9i M) . V(fc,f) G H. 

9: Update y f) £ P on the mobile app interface. 

10: end if 

11: Set r := r + 1. 

12: until r > r max . 


converge. Simulations suggest it is enough to set T max > 5 
when I < 30 and K < 10. 

In the data collection and sensing phase of Algorithm [T] 
each user moves around and completes her claimed sensing 
tasks. The service provider would only pay a user, if the 
user has claimed the task in the planning phase, and has 
completed the task with an acceptable quality. As in Field 
Agent 21) f° r quality control, the service provider can em¬ 
ploy techniques such as GPS marking (for location verifica¬ 
tion), time stamping (for time verification), and photo/video 
confirmation. Except the user input (line 2) and sensing ex¬ 
ecution (lines 16-19), all the other computation, communi¬ 
cation, and information checking from maps can be done by 
the mobile app on behalf of the users. 


4. CENTRALIZED TASK ALLOCATION 

In this section, we consider the benchmark centralized 
task allocation (CTA) problem, where the service provider 
seeks to maximize the social surplus in the TSG. We prove 
that the CTA problem is NP-hard. Due to the high com¬ 
plexity of finding the optimal solution, we propose a heuris¬ 
tic greedy centralized algorithm, which turns out to have 
close-to-optimal performance in our simulations. 


4.1 Centralized Task Allocation Problem 

In the CTA benchmark problem, the service provider allo¬ 
cates tasks to the users in order to maximize the social sur¬ 
plus (i.e., users’ total rewards minus total movement costs). 
The social surplus represents the maximum total profit that 
the service provider can generate, under the ideal case that 
all the users are under its direct controfl 

For user i’s given strategy (task-time route choice) n in 
©. we use yi{ri) = 1 to denote that user i works on task k 
under strategy n, and yi(ri) = 0 otherwise. That is, 


Vi in) 


1, if (k, t[k]) £ ri, 
0, otherwise. 


(5) 


Given the users’ strategy profile r, the social surplus is 

surplus(r) = reward(r) — cost(r), (6) 

where reward(r) is the total rewards received by all users, 
and cost(r) is the total movement costs of all users. Since 

9 In this case, each user will get a zero payoff, i.e., the pay¬ 
ment from the service provider equals to the user’s move¬ 
ment costs involved in finishing the tasks. 



















the reward p[k\ is equally shared among all the users working 
on task k, reward(r) equals the total reward of those tasks 
that have at least one assigned user. So we have 

reward (r) = 1 {E ie x v? <r 4 )>i}P[ fc l’ ( 7 ) 

kefc 

where 1{.} is the indicator function. Moreover, we have 

n —1 a+1 

cost(r) = ^cost i (r i ) = 5Z^c' [feil ’ ![fci \ (8) 

iex i£X a=l 


where cost;(r;) = X^a=i cp'^’ 1 ^' ^ is the movement cost of 

user i. 

In practice, it is reasonable to assume that the total move¬ 
ment cost of a user is non-decreasing in the number of loca¬ 
tions she visited. That is, 


i J',J" 

<V + G 


> cY , \/i ex,i,i',i" e c. 


(9) 


With this assumption, we can show that there always ex¬ 
ists an optimal task allocation where each task is allocated 
to at most one user. 


Lemma 1. There always exists a social surplus maximiz¬ 
ing (optimal) task assignment r* = argmax surplus(r) 

T-G71 1 X... x7 Zi 


such that ( r i) < 1 ,VkeJC. 


The proof of Lemma [T] is given in Appendix [C] With 
this lemmeFI. we can establish the NP-hardness of the CTA 
benchmark. 


Theorem 3. The problem of finding the social surplus 
maximization solution of the TSG is NP-hard. 

The proof of Theorem [3] is given in Appendix El Theorem 
[3] motivates us to consider a greedy heuristic algorithm to 
solve the CTA problem. 

4.2 Greedy Centralized Task Allocation Algo¬ 
rithm 

We propose a low complexity greedy centralized (GC) al¬ 
gorithm (Algorithm [3]), which computes an approximate so¬ 
lution to the social surplus maximization problem. The key 
idea is that the service provider sorts the tasks in ascending 
order of the execution time, and then allocates one user to 
each task sequentially in a greedy manner. Specifically, in 
line 6, the service provider first serves tasks with the earliest 
execution time. In line 9, the service provider sorts users in 
the order of an increasing movement costs (from the users’ 
current locations to the location of the current task k ). Line 
12 indicates that a user i will be chosen if it is eligible to 
perform the task (i : k £ ICi), can reach the task location on 
time ( t[k] — 4>i> and has the incentive to perform 

the task (p[k\ > (line 12). Here (pi is the first time 

slot of user i after finishing any existing allocating task. It is 
possible that some task k may not be allocated to any users 
(i.e., y? = 0, Vi€ 1). 


10 Note that there can be other optimal solutions that do not 
satisfy LemmaHowever, these solutions can be obtained 
readily from the op tima l solution in Lemma \l\ as we will 
discuss in Appendix lC.ll 


Algorithm 3 Greedy Centralized (GC) Algorithm for Task 
Allocation. _ 

1: Initialization 

2: Obtain movement costs, speeds, and initial locations from all 
the users: c?. 1 * , V/, l' G £, i G X, Vi and ZJ nit , Vi Gl. 

3: Initialize the next available time and current location of the 
users: pi := 1 and Zj := ZJ mt , Vi Gl. 

4: Initialize the optimization variables y f := 0, Vi G X, k G /C. 

5: Greedy Centralized Task Allocation 

6: Sort the tasks in set /C in the ascending order of the execution 
time t[k\. 

7: for k = 1 to K 
8: Set flag := 0. 

9: Sort the users in set X in the ascending order of c^’^. 

10: Set i := 1. 

11: while i < I and flag = 0 

12: if k G JCi and £[/c] — pi > and p[k] > 

13: Update user z’s next available time pi := t[k] and 

current location li := l[k\. 

14: Indicate the task allocation yk \= \ and set flag := 1. 

15: end if 

16: Set i := i + 1. 

17: end while 

18: end for 
19: for iGl 

20: Inform user i the task allocation (y^ , Vfc E /C). 

21: end for 

22: Data Collection and Sensing Phase 
23: for each user iGl 

24: Move and complete the sensing tasks based on the 

task allocation (z/^, Vfc G /C). 

25: end for 


Table 1: Simulation Parameters 


Parameters 

Values 

Number of tasks K 

10 

Number of time slots T 

15 

Rewards for three levels of tasks 

$10, $15, $20 

Duration of a time slot 8 

1 minute 

Movement speed v 

0.1 km/min 


Lemma 2. Alqorithm {3\ has a computational complexity 
of <D(KI log I). 

The proof of Lemma [2] is given in Appendix [E] 

5. PERFORMANCE EVALUATIONS 

In this section, we evaluate the performance of our pro¬ 
posed ADTS scheme by comparing with the CTA bench¬ 
mark, the GC scheme, and a greedy distributed (GD) scheme. 
We study the impact of various system parameters on the 
average user payoff, fairness, coverage, and average reward 
per measurement on different schemes. We also present an 
example using real movement time data from Google Maps. 
Our simulations involve the following schemes: 

• ADTS scheme: Algorithms [T] and [2] 

• CTA benchmark: Global optimal solution of the social 
surplus maximization problem. 

• GC scheme: Algorithm [3] that approximately solves 
the social surplus maximization problem. 

• GD scheme: Each user chooses to work on the earliest 
feasible task without coordinating with other users. 






















Figure 3: The average user pay¬ 
off versus the total number of 
users I for K = 10 and c move = 0.1. 


Figure 4: The average user pay¬ 
off versus the total number of 
users I for K = 5 and c move = 1. 


Figure 5: Jain’s fairness index 
versus the total number of users 
I for K = 10 and c move = 0.1. 





Figure 6: The coverage versus 
the total number of users / for 

K = 10 and c move = 0.1. 


Figure 7: The average reward 
per measurement versus the to¬ 
tal number of users I for K = 10 
and c move = 0.1. 


Figure 8: The average user pay¬ 
off versus the movement cost co¬ 
efficient c move for I = 7 and K = 
5. 


Like the GC scheme, user i should be eligible for the 
task (i.e., i : k G ICi), can arrive on time to perform the 
task (i.e., when t[k] — (f>i> A* 4,i ^ given that user i is 
at location U), and has an incentive to perform sensing 
assuming no other users share the reward of that task 
(i.e., p[k] > c^ W ). If there is more than one feasible 
task with the same execution time, then the user will 
choose the task with the highest reward. 

For each set of system parameter choices, we run the 
simulations 1000 times with randomized initial user loca¬ 
tions, and randomized locations and times for the tasks, 
and show the average value. Unless specified otherwise, we 
assume that the locations of the K tasks and I users are 
randomly placed in a 1 km x 1 km region. Each user is 
randomly allocated one of three reputation levels, and each 
task is randomly assigned a threshold. A user is eligible 
to perform a task when her reputation exceeds the thresh¬ 
old of the task. Each user i moves at constant speed Vi, 
and the movement time from location l £ C to location 
l' G L is A*’* = ^ j , where dist (1,1') is the travel¬ 

ing distance on the road between locations l and l', 5 is the 
length of a time slot, and ["•] is the ceiling function. We 
assume that the movement cost of user i is linearly pro¬ 
portional to the distance between two locations in the form 
c 1 / = cf love dist(l, l'). For simplicity, we assume that all the 
users have the same movement cost coefficient cf 10 ™ = c move 


and the same movement speed Vi = v. Other simulation pa¬ 
rameters are listed in Table [l] 

5.1 Average User Payoff and Fairness 

Impact of user number on user payoff: In Fig.[3l we 

plot the average user payoff against the number of users /, 
with K = 10 tasks and movement cost coefficient c move = 
0.1. The average user payoff decreases with I under all three 
schemes, due to the increased competition with the increas¬ 
ing number of users. The ADTS scheme achieves a similar 
average user payoff as the GC scheme. 

Comparison with centralized optimal solution: In 
Fig. U we plot the average user payoff against I in a smaller 
scale, with K = 5 and c move = 1, so that we can evaluate the 
CTA benchmark as well. Note that GC achieves a similar 
payoff to the benchmark, especially when I is large. 

Fairness: In Fig. [5] we study how the payoffs are dis¬ 
tributed among the users by plotting Jain’s fairness index [5] 
(defined as Ui ( r )) 2 /(I u i ( r ) 2 )) against /, with 

K = 10 and c move = 0.1. The ADTS scheme has the highest 
fairness index, as each user has an equal chance to update 
her strategy profile in ADTS in Algorithm [I] On the other 
hand, the GC scheme has the lowest fairness index, as it 
allocates K tasks to at most K users. As a result, when 
I increases, the number of users not allocated any task in¬ 
creases, so the fairness decreases. 

5.2 Coverage and Reward Per Measurement 













































Figure 9: The coverage ver¬ 
sus the movement cost coef¬ 
ficient c move for I = 10 and 

K = 10. 


Figure 10: The coverage versus 
the movement speed v for I — 10 
and K = 10. 


Figure 11: An illustration on the 
task selection and mobility plans 
of the users on the map under 
the ADTS scheme. 


Table 2: Movement Time Data from Google Maps 


Drive/Walk (min) 

Location 1 

Location 2 

Location 3 

Location 1 

0/0 

3/11 

4/15 

Location 2 

3/11 

0/0 

2/6 

Location 3 

4/15 

2/6 

0/0 


We examine the coverage and average reward per mea¬ 
surement under the three schemes, with K = 10 tasks and 
c m ° ve = 0.1. The coverage is defined as the percentage of 
tasks with at least one measurement received. 

Impact of user number on coverage: In Fig. [6] we 
plot the coverage against I. When I increases, we see the 
coverage under all the schemes increases, as there are more 
users to work on the tasks. The ADTS scheme has a slightly 
higher coverage than the GC scheme, as users try to avoid 
too much overlapping with each other in ADTS. The large 
difference between the coverage of the ADTS and GD schemes 
is due to the lack of coordination in the GD scheme. 

Average reward per measurement: Fig. [7] shows the 
average reward per measurement against I. We can see that 
the ADTS and GD schemes have the same smaller aver¬ 
age reward per measurement than the centralized scheme 

GC. From a service provider’s point of view, with the same 
amount of total reward, lower average reward per measure¬ 
ment means that more sensing data can be collected, which 
is beneficial for the service provider. 

5.3 Impact of Moving Cost and Speed 

Impact of movement cost on user payoff: In Fig. 0 
we plot the average user payoff against c move with I — 7 
and K = 5. As c move increases, the payoffs for collecting 
measurements under both ADTS and GC decrease. More¬ 
over, since users under both ADTS and GC are aware of 
the actual reward that they can receive (after sharing with 
other users), the users would not receive a negative pay¬ 
off for collecting data. However, for the GD scheme without 
any user coordination, a user makes decisions with the belief 
that she will gain the full reward p[k] for task k, rather than 
the actual reward she can achieve (after sharing with other 
users). When c move < 4, users become overly aggressive in 

GD, and it becomes common that multiple users work on 


Table 3: Summary of Performance in the Real-world 
Example 



ADTS 

GC and CTA 

benchmark 

GD 

Average user payoff 

$10.33 

$n 

$7 

Jain’s fairness index 

0.93 

0.64 

0.93 

Coverage 

100% 

100% 

66.67% 


the same task. As a result, the actual reward per user is 
much less than the full reward of the task, which results in 
the decrease of the average payoff (even to a negative value) 
in GD. However, when c move increases beyond 4, users are 
less likely to work on the same task in GD, so the evaluation 
of the reward is more accurate, and the average user payoff 
in GD gradually converges to a non-negative value, similar 
to those under the ADTS and GC schemes. 

Impact of movement cost on coverage: In Fig. [9] 
we plot the coverage against the movement cost coefficient 
c move with I = 10 and K = 10. As c move increases, the cov¬ 
erage decreases in general, because the users are less willing 
to perform sensing. The slight increase in coverage in GD 
for c move < 2 is due to the inaccurate reward estimation 
mentioned in the previous paragraph. 

Impact of movement speed on coverage: In Fig. 1101 
we plot the coverage against the movement speed v for I = 
10 and K = 10. As v increases, the coverages under all three 
schemes increases. The ADTS scheme has a slightly larger 
coverage than the GC scheme. 

5.4 A Real-World Example Using Google Maps 

To better illustrate the schemes, we next introduce a more 
concrete example based on the neighborhood map in New 
York City with real data from Google Maps. We consider 
I — 3 users and K = 3 tasks. The locations of the users, 
tasks, the execution times, and rewards are shown in Fig. 1111 
We assume that user 1 prefers driving, while users 2 and 3 
prefer walking. We compute the movement time of driv¬ 
ing and walking between different locations through Google 
Maps, as shown in Table [2] For the movement cost, we as¬ 
sume that driving between any two different locations costs 
$2 and walking is free. 





































Fig. IIII shows the task selection plans of three users (i.e., 
the blue, red, and purple broken lines with arrows) under 
the ADTS scheme. As we can see, user 1 takes the task 
selection plan of task 3 —» task 2, while users 2 and 3 take 
the task selection plan of task 1 —> task 2. The intuition 
is that as users 2 and 3 are pedestrians, who do not have 
enough time to go to the far away location 3 from their initial 
locations. As a result, user 1, who is driving, prefers to take 
task 3 with the minimum competition, and enjoys a higher 
reward. After finishing their first chosen task, all three users 
have enough time to work on task 2, and share the reward 
of $10. In contrast, under both the GC scheme and CTA 
benchmark (not shown in the figure), user 1 works on task 3, 
user 2 works on both tasks 1 and 2, and user 3 does not have 
any task to work on. For the GD scheme (not shown in the 
figure), since the initial locations of all the users are nearby, 
their task selection plans are identical: task 1 —> task 2, 
as everyone chooses to pursue the earliest task without any 
coordination. The performance of the three schemes are 
summarized in Table [3] We can see that the ADTS scheme 
achieves a high level of fairness, coverage, and user payoff. 

6. CONCLUSION 

Motivated by commercial mobile crowdsensing applica¬ 
tions, such as Gigwalk and Field Agent, we studied the 
distributed time-sensitive and location-dependent task se¬ 
lection problem in this paper. We examined the problem 
from the perspectives of both noncooperative game and cen¬ 
tralized optimization. We proposed an asynchronous and 
distributed task selection (ADTS) algorithm to help users 
determine their task selection and mobility plans in a dis¬ 
tributed fashion, based on limited information on users’ ag¬ 
gregate task choices. We proved that finding the social sur¬ 
plus maximization solution is NP-hard. Simulation results 
showed that the ADTS scheme achieves the highest Jain’s 
fairness index and coverage as compared with two heuristic 
schemes. In this paper, we focus on the distributed task se¬ 
lections of a fixed set of users. In the future work, we will 
consider the case a changing user population, where users 
can dynamically join and leave the system. 


every finite game with a potential function has the FIP. Since 
the TSG is a finite game, we prove the statement in Theorem 

in 

To show that <f> (r) is a potential function, we shall sepa¬ 
rate our sums into two parts. 


^ (r ) = ^2 p k,t 

( k,t)eKxT 9=1 5 


and 


( 10 ) 


( 11 ) 


( r ) = E E 

iex eeE(ri) 

Adding equations Col) and m gives 

$ (r) = $> v (r) + (r). (12) 

Also, for a subset S C /C x T of task-time points, let us 
define 
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Clearly &]cxT ( r ) = < U ( r ) ■ Let us define 
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and 
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By comparing this with equation © one can see that 
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Adding equations m and m gives 


7. ACKNOWLEDGMENT 


Ui ( r ) = UY (r) + Uf ( r ). (17) 


This work is supported by University of Macau Grant 
MYRG2014-00140-FST. This work is also supported by the 
General Research Funds (Project Number CUHK 412713 
and 14202814) established under the University Grant Com¬ 
mittee of the Hong Kong Special Administrative Region, 
China. 


Suppose that our game is in strategy profile r = (rq,.., 77 ) 
and then user j G I changes their strategy to r'j € IZj , and 
so we have a new strategy profile r' = (r'j, r_j) . In order to 
show that 4> is an exact potential function we will show 

— $ (r) =Uj (r',r_j) -Uj(r). (18) 


APPENDIX 

A. PROOF OF THEOREM H 

The key idea is to show that the mapping 


J) (r) 


E 


E 


i(fe,t)e/Cxr 9=1 


p* imA 
9 ) 


-EE 9i{e) 

ieX eGE (r^ 


is a potential function on of the TSG (where E(r^) is defined 
in Definition E3). This means that for each strategy profile 
r — (n,. .., n) G TZi x ... x Til, each user i€l, and each 
strategy r' G 77.; available to user i, we have $ (r',r_;) — 
$ (r) = Ui ( r'i,r-i) - Ui (r). In [IT], the authors prove that 


To begin, note that we can expand out the left hand side 
of equation m using equation m to obtain: 

$ ( r 'j, r -j) -$(r) 

= [ $V (r)] + (r)] . 

(19) 


Equation (1161) implies that 


^ (r) = ^ \uf - Uf (r)l 

iex (20) 

= Uf (r'j,r-j) - Uf (r ). 






Using equation m- we can obtain that 

-$ V ( r) 

= [ < ^KxT-(v(r ;| -)u¥(r'.)) { r 3’ r ~j) — ® K. X T- (v(r-j ) UV(rJ )) ( r )] 
+ [®v(rj)nV(rj) ( r t’ r -t) _ ^v(r-j)nV(r') ( r )] 

+ [ $ V(r')-V(rj) { r i’ r -i) ~ ( r )] 

+ [^*v(rj) — v(rt) ( r i’ r -t) - ®v(r 3 )-V(r') M] ' 

( 21 ) 

Now since m < ' k ’ t ' > (r) = m^ k ' t ' > (r^, r_j) holds true for every 
(M) £K.xT- (V(rj)UV(r')) or (fc,t) eV(r 3 )nV(r'j 
we have that 


Substituting equations (1231) and (1241) into equation 
gives us that 


i r 'j’ r -j) - $V ( r ) 
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P* [&, i] 
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Substituting equations and (l25)l into equation 09, 
gives us that 

$ ( r t> r -l) - 

= [^7 W^-i) - ^7 ( r )] + [^f ( r 7 r -i) - ^7 0)] ■ 

(26) 


[^KxT-(v(rj)uv(rj)) { r ji r ~j) ®(CxT-(v(rj)u¥(r')) ( r )] 
+ [ < ^ v ( r j) nv ( r j) ( r t’ T ’-f) — ^ > V(r ; j)nV(rt) M] = 

and so equation m can be simplified to give 


$ V ( r 'j’ r -i) - ( r ) 

= [ $ v(r))- V b) ~ ®V(rt)-V(r 3 ) ( r )] ( 22 ) 

+ [ $ V(r^)-V(rJ) ( r J’ r ~y) - K(rj)-V(r') ( r )] • 


Using equation urn together with the fact that mS k,t ^ (r'-, r_j) 
m ( fc 4) \/(fe, t) £ V (r'j ) — V ( 77 ) gives us that 

^ > V(r')-¥(r- 3 -) ( r i’ r -i) — ^ > V(r-')-y(r 3 ) M 

= E 
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Similarly, using equation 09, together with the fact that 
m (M) — mf- k ’ t ' > (r) — 1, V(fc,t) € V(r,) — V (r') 

gives us that 
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(24) 


Now equation C3 can be used to simplify the right hand 
side of equation (1261) . and this yields the right hand side of 
equation m, as required. ■ 


B. PROOF OF THEOREM H 

The key proof idea is to show that the problem of a generic 
user j finding a best response strategy in a strategy profile 
r can be reformulated as the problem of finding a longest 
path in a weighted directed acyclic graph Q *, which can be 
solved in polynomial time. 

To construct such a graph Q *, we construct a weighted 
directed graph Q = (V, £). Recall that V is defined in Defi- 
nition[4]and E is defined in Definition[G] The vertex set V = 
U S 67 j V (s) corresponds to the set of task-time points ( k , t), 
ayhich lie on routes which are feasible for user j. The directed 
edge set £ = UseTC ® ( s ) corresponds to the set of all pairs 
[(k, t ), (k',t')\ € E (s) such that user j can move between. 
Each directed edge e = [(fc, t), (k', T)] of the graph Q is as¬ 
sociated with an edge weight w (e) = —gj{e) = — c l J k ^’ l ^ k \ 
Each vertex ( k , t ) is associated with a value 


*((M))» 


p* [k, t] 


\{i&T.i+ j, ( k , t) G V (ri)}| + 1 ’ 


(27) 


where |{* € X : * j, (k, t) € Y (n)}| equals the number of 
users other than j which have selected task-time routes that 
pass through ( k,t ) in the strategy profile r. Notice that 
8 ((fc, t)) equals the reward that user j would get for visiting 
task-time point ( k,t ). 

A path in a directed graph is a sequence of vertices, where 
each vertex is linked to its successor in the sequence. A best 
response for user j corresponds to a path in the graph Q, 
which starts at the task-time point (fcj mt , 1 ), and maximizes 
the sum of the weights of the edges, and the values of the 
vertices along the path. 

Starting from graph Q defined above, we form a new graph 
Q * = (V*,£*), by replacing each vertex ( k,t ) in Q with 
a pair of new vertices (k,t, 0) and (k,t, 1). These two new 
vertices are linked by a new directed edge [(k, t, 0 ), ( k , t, 1 )] , 
which has an edge weight w* ([(k,t, 0 ), (k,t, 1 )]) = 8((k,t)) 
as in (EH). equal to the value of the original vertex ( k , t). 

As a result, graph Q* has a vertex set 


V* = {(fc, t, 7) : (k, t) £ V, 7 € {0,1}} , 

























and an edge set £* = £ j" U • Here set 

£i = {[(M,l), (fc',*', 0)] : [(k,t),(k',t')] G £}, 

and each edge [(fe, t, 1), ( k ', t', 0 )] € £f has a weight 

w* ([(k,t,l),(k',t',0)]) = w([(k,t),{k',t')}) , 

with the right hand side being the weight of the edge origi¬ 
nally in graph Q. Also set = {[(fc, t, 0), (fc, t, 1)] : (k, t) G V} , 
and each edge [(fe, t, 0), ( k , t, 1)] G has a weight 

w* ([(k,t,l),{k',t',0)]) = 6 >((M)), 

where the right hand side follows m- 

By considering the weights of all edges in Q* as lengths, 
one can see that a best response for user j in strategy pro¬ 
file r corresponds to a longest path in Q * that starts at the 
vertex (fc™ f , t, 0). Such a longest path can be computed in 
<D(|V*| x |£*|) = O (K 3 T 3 ) time using the algorithm de¬ 
scribed in m, because Q* is a directed acyclic graph. Since 
Q* itself can be constructed in O ( K 2 T 2 ) time based on the 
previous discussions, the entire process for a user to compute 
the best response has a complexity of O (^K 3 T 3 'j . ■ 


C. PROOF OF LEMMA H 

We prove the lemma by contradiction. Assume that for 
any optimal solution r * , we can always find a task q G K . 
such that Vi{ r i) > 1- Let us focus on a particular user 

j € I that is assigned to work on task q (i.e., j/|(r*) = 1 ). 
We can define another strategy profile r , which is the same 
as r * for the task allocation of all users, except that task q 
is not allocated to user j. That is, 


Vi in) 


0 , if k = q and i = j, 

Vi ( r *), otherwise. 


(28) 


Since the coverage of the tasks under r is the same as r *, 
where J2 ie x vH r i) > 1 and J2i £X yH r *) > 1 for each task 
k G 7C, we have from 0 that 

reward(r) = reward(r*). (29) 


Since user j works on one task less in rj than in r*. from 
©> we have 


As an example, consider the case with two users (i.e., I = 
2) and three tasks (i.e., K = 3). For the execution time of 
the tasks, we assume that f[l] = 2, t[ 2] = 3, and t[3] = 4. 
For the locations of tasks, we assume that the three tasks 
are located on a straight line, where the movement cost of 
user 1 satisfy the condition 


i „ 2 . 3 A . 3 

+ c i =<h. ■ 


(33) 


That is, the location l[ 2] of task 2 is between the locations 
Z[l] and l[ 3] of tasks 1 and 3. 

Assuming that the strategy profile with 

n = ((fcl nit , 1 ), ( 1 , 2 ), (3,4)) 

and 


r 2 = ((^ nit ,l),(2,2),(2,3),(2,4)) 

is an optimal solution of the social surplus maximization 
problem, which satisfies the condition in Lemma 1. From 
this strategy profile, we can define another strategy profile 

fi = ((M nit , 1), (1,2), (2, 3), (3,4)) 

and 


r 2 = ((fcj, nit , 1), (2,2), (2,3), (2,4)), 

where both users work on task 2 at time 3. Notice that the 
social surplus achieved under this strategy profile r is the 
same as the strategy profile r, since user 1 does not need to 
incur any extra movement cost in completing task 2 . 


D. PROOF OF THEOREM 0 

We prove the NP-hardness by restriction [5]: We show 
that finding the social surplus maximization solution in a 
special case of a TSG can be transformed into a 3-dimensional 
matching decision problem, which is NP-complete [5][9]. 

Definition 10. (3-dimensional matching) Let X, y, and 
Z be three finite disjoint sets. Let RClfxJx^bea set 
of ordered triples, i.e., 7 Z = {(x,y, z) : x G X,y G y,z G Z}. 
1Z 1 C TZ is a 3-dimensional matching if for any two different 
triples (xi,yi,zi) G 7 Z' and (x2,V2,Z2) G 7 Z', we have x\ ^ 
X2, yi 2 / 2 , and zi ± Z 2 ■ 


costj(rj) < costj(r*). (30) 

However, the movement costs of other users remain the same 
such that 

costi(ri) = costj(r*), Vi G (31) 

Overall, substituting 03), 03, and m into ©. we have 

surplus(r*) < surplus(r), (32) 

which means that strategy profile r is also an optimal so¬ 
lution. This leads to a contradiction, and hence proves the 
theorem. ■ 

C.l Discussion of multiple solutions of the so¬ 

cial surplus maximization problem 

We note that there can be other optimal solutions that 
do not satisfy Lemma 1. However, these solutions can be 
obtained from the optimal solution in Lemma 1, when there 
are no extra movement costs for users to work on some tasks, 
so that there can be more than one user working on a par¬ 
ticular task in the optimal solution. 


Definition 11. (3-dimensional matching decision prob¬ 
lem) Suppose \X\ = | y | = \Z\ = M. Given an input 1Z 
with \TZ\ > M, decide whether there exists a 3-dimensional 
matching TZ’ C 7Z with the maximum size \1Z'\ = M. 

Consider a restricted TSG, which is a special case of a 
TSG, which we place the following restrictions: 

(a) Tasks and time slots: There are T — 3 time slots, 
and there are M tasks in each time slot (hence a total of 
3 M tasks in 3 time slots). Sets X, y, and Z represent 
the sets of available tasks in the three time slots, where 

\x\ = \y\ = \z\ = M. 

(b) Task-time route: Set 7 Z represents the set of available 
task-time routes of all the users. Assume that each user i can 
only choose one particular available task-time route r,; G 1Z. 
We assume that the number of users I is large enough, such 
that the task-time routes of all the users cover all the tasks 
in X U y U Z, so \1Z\ = I > M. 1Z' C 1Z in Definition [TT] 
represents a feasible task allocation. We assume that a user, 
whose route is not chosen in 1Z', will not work on any real 
task. 



Figure 12: An example of 3-dimensional matching. 
Here, the set IZ consists of the four grey areas, which 
represent the routes of users 1 to 4. The solution of 
the 3-dimensional matching problem is the set IZ' 
that consists of the routes of users 1, 2, and 4. It is 
also the solution of the social surplus maximization 
problem. 

(c) Large reward: The reward of a task is larger than the 
movement cost to work on the task, i.e., p[k] > c} k 1 ^ for 
each user i £ X and all tasks k, k' £ 1C. 

(d) User-independent movement cost: The movement cost 
only depends on its destination location and is independent 
of its initial location, such that c\’ 1 = c[l'] for all i £ X 
and l, l' £ C, where c[l’\ is the movement cost of destination 
l' £ C. 

In the restricted TSG, first, restrictions (c) and (d) im¬ 
ply that we can maximize the social surplus by covering 
all the tasks with any available users. Second, Lemma [I] 
(and the discussion in Appendix 1C.II) implies that we can 
focus on an optimal solution, where each task should be al¬ 
located to at most one user. So the optimal task allocation 
should not contain any overlapping components (i.e., multi¬ 
ple users working on the same task) as defined in Definition 
1101 Putting the above discussions together, we know that 
in the social surplus maximization solution, every element 
of X x y x Z (i.e., every task) should be contained in ex¬ 
actly one of the triples (i.e., the task-time routes) in IZ'. 
In other words, IZ' C IZ is the optimal task allocation. So 
the surplus maximization problem can be transformed to 
a 3-dimensional matching decision problem, which is NP- 
complete 00. By restriction, we establish that the prob¬ 
lem of finding the social surplus maximization solution of 
the TSG is NP-hard. ■ 
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E. PROOF OF FEMMA |2 ] 

The sorting of execution time in line 6 takes 0(K log K) 
time 0. For each task, besides some simple computations, 
most of the time is consumed for sorting the movement 
costs of the users in line 9, and it takes a running time of 
0(1 log/). Since we generally have more users than tasks, 
i.e., /log/ > log A, the total run time with K tasks is 
0(KI\ogI). U 
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